SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN THAT I, HIROBUMI NISHIDA , a 
citizen of Japan residing at Kanagawa, Japan has 
invented certain new and useful improvements in 

IMAGE PROCESSING DEVICE". ESTIMATING BLACK CHARACTER 
COLOR AND GROUND COLOR ACCORDING TO CHARACTER- ARE A 
PIXELS CLASSIFIED INTO TWO CLASSES 



of which the following is a specification:- 



TITLE OF THE INVENTION 

IMAGE PROCESSING DEVICE ESTIMATING BLACK 
CHARACTER COLOR AND GROUND COLOR ACCORDING TO CHARACTER- 
AREA PIXELS CLASSIFIED INTO TWO CLASSES 
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BACKGRQUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to an 
image processing device, an image processing program, 
5 and a storage medium, and more particularly, to an image 
processing device and an image processing program which 
can adjust a contrast of a black character and a ground 
in an output image, and a storage medium storing the 
image processing program. 

10 2. Description of the Related Art 

For example, when an image output apparatus 
deals with an image (a document image) obtained by an 
image input apparatus, such as a scanner, scanning a 
printed document intended to communicate contents 

15 thereof, the document image is also intended to 

communicate contents thereof. Therefore, enhancing 
readability of a text in the output document image is a 
most important concern upon outputting the document 
image . 

20 However, for example, when a color image 

output apparatus, such as a color printer, prints a 
digital image obtained by using a color image input 
apparatus, such as a color scanner, to input a document 
printed with a white ground, printing the image without 

25 applying any image processing thereto may cause problems 



of lowering a contrast between a character and the 
ground, or coloring a black character or the ground 
which should originally be black or white, thereby 
decreasing readability of the text. 

These problems occur because colors and 
gradations are not coordinated between the image input 
apparatus and the image output apparatus. When image- 
processing apparatuses, such as the image input 
apparatus and the image output apparatus, are stand- 
alone, image-processing algorithms and parameters are 
often designed according to characteristics of 
individual apparatuses . 

There is a related technology, for example, in 
which not only a process of emphasizing a pixel 
corresponding to a black character used in a copier, etc. 
(for example, disclosed in Japanese Patent No. 2558915 
(Japanese Laid-Open Patent Application No. 3-274862), 
Japanese Laid-Open Patent Application No. 2000-196871, 
Japanese Laid-Open Patent Application No. 2000-316097, 
Japanese Laid-Open Patent Application No. 2001-078036, 
and Japanese Laid-Open Patent Application No. 2001- 
169133), and a process of specifying a middle-tone area 
(for example, disclosed in Japanese Patent No. 3158498 
(Japanese Laid-Open Patent Application No. 5-014724), 
Japanese Laid-Open Patent Application No. 2001-036748, 
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Japanese Laid-Open Patent Application No. 2001-144962, 
and Japanese Laid-Open Patent Application No. 11-220631) 
are performed, but also a process of performing a tone 
correction according to a histogram of a pixel signal 
5 value (for example, disclosed in Japanese Laid-Open 

Patent Application No. 2000-013616, Japanese Laid-Open 
Patent Application No. 2000-013625, Japanese Laid-Open 
Patent Application No. 2000-115538, Japanese Laid-Open 
Patent Application No. 2000-242777, Japanese Laid-Open 

10 Patent Application No. 2001-045303, Japanese Laid-Open 
Patent Application No. 2001-148785, Japanese Laid-Open 
Patent Application No. 2001-167260, Japanese Laid-Open 
Patent Application No. 2001-189862, Japanese Laid-Open 
Patent Application No. 2001-189863, Japanese Laid-Open 

15 Patent Application No. 2001-197312, Japanese Laid-Open 
Patent Application No. 2001-222711, and Japanese Laid- 
Open Patent Application No. 10-283470) is performed. In 
such a technology, image-processing algorithms and 
parameters usually depend greatly on color 

20 characteristics, resolutions, and frequency 
characteristics of the input apparatus. 

Besides, when a digital image obtained by 
inputting a printed document from a color image 
apparatus, such as a scanner, is printed from a color 

25 printer, or is displayed on a display, without applying 



any image processing thereto, this may cause a 
problematic phenomenon of a non-uniform color of a 
ground or a background, depending on different 
characteristics of the scanner (i.e., difference in 
reading conditions) upon inputting the printed document, 
or may cause a problematic phenomenon of a so-called 
"show-through effect" in which contents printed on a 
backside of the printed document are transparently input. 
In such cases, the image as a whole often appears 
uncleanly. 

For solving the above-mentioned problematic 
phenomena in a case where the color of the ground or the 
background of the subject digital image is white, it is 
effective to perform a process referred to as "ground 
removal" or "ground-color removal" which replaces 
portions of the ground or the background with white. 

Well-known examples of the process referred to 
as "ground removal" or "ground-color removal" include a 
process of detecting a ground level according to a 
histogram of a pixel signal value, and thereby removing 
a ground (for example, disclosed in Japanese Laid-Open 
Patent Application No. 2000-022971 and Japanese Laid- 
Open Patent Application No. 2000-078408). When the 
color of the ground or the background is white, such 
processing prevents an image deterioration resulting 



from different characteristics of the scanner (i.e., 
difference in reading conditions) or from a "show- 
through effect". 

Besides, there is a process of estimating a 
background color of a document, and using a difference 
between the background color and a pixel value of a 
targeted pixel to change the pixel value of the targeted 
pixel (for example, disclosed in Japanese Laid-Open 
Patent Application No. 2000-050083) . In this process, a 
background color is determined by finding a brightest 
area from histograms of a same color; a background 
reference color is determined from a color distribution 
according to a result of discrimination of either of the 
following four patterns as a background color; and a 
color of each pixel is adjusted according to a 
difference between the above-determined reference color 
and the pixel. In this technology, the . four patterns 
discriminated as a background color are: (1) nearwhite, 
contone (white-ground copy paper, newspaper printing 
paper, etc.), (2) near-white, halftone (magazine 
printing paper etc.), (3) far-white, contone (photograph, 
colored paper, etc.), and (4) a far-white, halftone 
(magazine printing paper etc.). . 

Besides, there is a technology in which a 
threshold value for judging an area to be either an 
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image area, a ground area or other area is determined by 
analyzing a distribution state of peaks with respect to 
a histogram of a gradation distribution; accordingly, an 
image area is output without any processing, a ground 
5 area is converted into a ground color, and the other 

area is processed according to predetermined conditions, 
thereby preventing an image deterioration (for example, 
disclosed in Japanese Laid-Open Patent Application No. 
2001-045297) . 

10 By the way, the recent spread of network 

environments has caused circumstances in which an image 
input via an image apparatus is transmitted to a remote 
place via a network, and a receiver edits, reuses, 
prints and/or retransmits the image on a PC (personal 

15 computer) In such a network environment, processings, 
such as edition, reuse, printing, and retransmission are 
performed to an image input via an unknown image 
apparatus; this involves new technical problems not 
considered in the conventional technology as disclose in 

20 the above-mentioned Japanese Laid-Open Patent 
Application documents. 

That is, in a network environment, 
input/output apparatuses have various characteristics; 
further, when a digital image is input and transmitted 

25 from a remote place, an input apparatus is sometimes 
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unknown. In addition, an image-processing system 
optimal for a certain specific model may not necessarily 
operate effectively to a model having different 
characteristics. Therefore, when an image input by an 
5 image input apparatus is output by an image output 

apparatus, precision in recognizing an image may vary, 
thereby deteriorating a quality of an output image, and 
decreasing a readability of a text in the output image. 

As a countermeasure therefor, an adaptive 

10 binarization is effective, as long as problems of the 
contrast between a character and a ground and coloring 
of a black character and a ground are concerned. 
Research and development of binarization technology have 
been well conducted as a function of preprocessing for 

15 an OCR in a document image processing. However, 
applying a binarization causes a problem that a 
gradation and a color in a middle-tone area are lost. 

This problem may be solved by using a 
technique of dividing an image into areas, such as a 

20 text, a photograph, and a line drawing, which is used in 
a document image recognition, so as to apply the 
binarization only for an area in which a black character 
is written on a white ground. With such a technology, 
even when an area is incorrectly recognized, edition and 

25 correction are easy in OCR whose output forms are text 
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files, such as HTML. However, when a high-definition 
re-output of a document image is intended, edition and 
correction of an image may become very complicated when 
an area is incorrectly recognized. 
5 This necessitates a comprehensive tone 

correction suitable for a document image, namely a 
"soft" comprehensive threshold processing, such as 
changing a white ground into white and changing a black 
character into black while maintaining a gradation of a 

10 middle-tone area. At this point, the comprehensive tone 
correction, i.e., the "soft" comprehensive threshold 
processing means a processing such as applying an 
identical conversion to all of pixels. 

Besides, since image input apparatuses have 

15 various characteristics, hard-coding of a parameter used 
as a "predetermined threshold value" must be avoided, 
for example when performing a processing of substituting 
white for a color of a pixel having a saturation equal 
to or smaller than the predetermined threshold value, or 

20 when performing a processing of regarding a color of a 
pixel having a saturation equal to or smaller than the 
predetermined threshold value as monochrome. Preferably, 
the parameter (the color of a ground or a black 
character) determining the above-mentioned processing is 

25 calculable adaptively from an image. 
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Especially, image output apparatuses, such as 
printers, have various capabilities in representing a 
highlight color; there is not a uniformity; for example, 
a same signal is clipped to become white (not printed at 
5 all) in one case, and is printed visibly in another case. 
Therefore, upon the above-described image processing, it 
is preferred that a user can easily adjust the 
processing according to characteristics of the image 
output apparatuses . 

10 Besides, in a color document image, a ground 

or a background may often have arbitrarily set colors, 
and a background may often have a complicated 
composition, including a plurality of background colors. 
A histogram obtained by simply examining 

15 colors and a gradation distribution in such a color 
document image including a complicated background 
composition has a form of "mixture distribution" in 
which multiple peaks and valleys appear in the histogram. 
Therefore, it is difficult to conjecture a true 

20 background color. 

The above-described technology disclosed in 
Japanese Laid-Open Patent Application No. 2000-022971, 
Japanese Laid-Open Patent Application No. 2000-078408, 
or Japanese Laid-Open Patent Application No. 2001-045297 

25 refers to a method of extracting a background or a 
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ground color from a color document image having such a 
"mixture distribution" . However, since an actually 
obtained histogram also includes many peaks and valleys 
deriving from noises, a desired background or ground 
5 color cannot always be extracted correctly, but may 
often be extracted incorrectly. 

Besides, in the above-described technology 
disclosed in Japanese Laid-Open Patent Application No. 
2000-050083, a background or a ground is discriminated 

10 to be contone or halftone (dots). However, since the 

discrimination of the halftone dots strongly depends on 
a frequency characteristic (MTF) of an input apparatus, 
it is difficult to apply this technology when data is 
transmitted from a remote place via a network with an 

15 input apparatus being unknown. 

SUMMARY OF THE INVENTION 

It is a general object of the present 
invention to provide an improved and useful image 
20 processing device, an image processing program and a 

storage medium in which the above-mentioned problems are 
eliminated . 

A more specific object of the present 
invention is to provide an image processing device and 
25 an image processing program which can not only maintain 
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a gradation of a middle-tone area in an original image 
input from various image input apparatuses, but also can 
automatically adjust a contrast between a black 
character and a ground in an output image without a user 
5 performing an operation, such as setting a parameter, 
and to provide a storage medium storing the image 
processing program. 

In order to achieve the above-mentioned 
objects, there is provided according to one aspect of 

10 the present invention an image processing device 

including a character area extraction part extracting a 
character area from an original image that is a digital 
image, a class classification part classifying pixels 
belonging to the character area into a first class and a 

15 second class according to colors, a black-character- 
color/ground-color estimation part estimating a black 
character color and a ground color on the original image 
according to the pixels belonging to the character area 
being classified into the first class and the second 

20 class, and a tone correction part performing a tone 
correction to the original image according to the 
estimated black character color and the estimated ground 
color. 

According to the present invention, upon 
25 outputting the original image, which is a digital image 
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input from an image input apparatus, from an image 
output apparatus, for example, the black character color 
and the ground color on the original image are estimated 
according to the class classification of the pixels 
5 belonging to the character area according to colors, and 
the tone correction can be performed to the original 
image according to the estimated black character color 
and the estimated ground color without using 
predetermined parameters. Therefore, while a gradation 

10 of a middle-tone area in the original image input from 
the various image input apparatus is maintained, a 
contrast between the black character and the ground in 
an output image can be automatically adjusted, without a 
user performing an operation, such as setting a 

15 parameter. 

In order to achieve the above-mentioned 
objects, there is also provided according to another 
aspect of the present invention an image processing 
device including a character area extraction part 

20 extracting a character area from an original image that 
is a digital image, a class classification part 
classifying pixels belonging to the character area into 
a first class and a second class according to colors, a 
background color estimation part estimating a background 

25 color on the original image according to the pixels 
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belonging to the character area being classified into 
the first class and the second class, a background area 
specification part specifying a background area on the 
original image according to the estimated background 
5 color, and a tone correction part performing a tone 

correction to the original image by replacing a color of 
the specified background area with the estimated . 
background color or a white color. 

According to the present invention, upon 

10 outputting the original image, which is a digital image 
input from an image input apparatus, from an image 
output apparatus, for example, the background color is 
estimated from original image according to the class 
classification of the pixels belonging to the character 

15 area according to colors, and the background area is 

specified according to the estimated background color; 
then, the color of the specified background area is 
replaced with the estimated background color or a white 
color. Thereby, the tone correction can be performed to 

20 the original image without using predetermined 

parameters. More specifically, an image deterioration 
originating from causes in a background part, such as 
changes of color of the background or a show-through 
effect, in the original image input from the various 

25 image input apparatus is corrected automatically without 
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using predetermined parameters, or without a user 
performing an operation, such as setting a parameter, so 
that the background part is made more appropriate, 
thereby obtaining an output image containing easily 
5 viewable characters and so forth. 

Other objects, features and advantages of the 
present invention will become more apparent from the 
following detailed description when read in conjunction 
with the accompanying drawings. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG.l is a diagram showing an example of a 
configuration of a system according to a first 
embodiment of the present invention; 
15 FIG. 2 is a diagram showing a module structure 

of a personal computer shown in FIG.l; 

FIG. 3 is a functional block diagram 
illustrating parts operated by the personal computer 
according to an image processing program in the first 
20 embodiment of the present invention; 

FIG. 4 is a flowchart showing a procedure of 
operating the parts included in the functional block 
diagram shown in FIG. 3; 

FIG. 5 shows an example of an input image in 
25 the first embodiment of the. present invention; 
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FIG.6 shows a character area extracted from 
the input image shown in FIG. 5, and blocks divided in 
the input image shown in FIG. 5; 

FIG.7 shows a window selected from the input 
5 image divided into the blocks shown in FIG. 6, and groups 
of pixels classified in the selected window; 

FIG. 8 is a graph used for explaining a tone 
correction according to statistics of a black character 
color and a ground color in the first embodiment of the 
10 present invention; 

FIG. 9 shows an image obtained by performing 
the tone correction of the first embodiment of the 
present invention to the input image shown in FIG. 5; 

FIG.10 is a functional block diagram 
15 illustrating parts operated by the personal computer 
according to an image processing program in a second 
embodiment of the present invention; 

FIG. 11 is a flowchart^ showing a procedure of 
operating the parts included in the functional block 
20 diagram shown in FIG. 10; 

FIG. 12 shows an example of an input image in 
the second embodiment of the "present invention; 

FIG. 13 shows pixels extracted as a background 
from the input image shown in FIG. 12; 
25 FIG. 14 is a graph used for explaining a tone 
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correction according to statistics of a background color 
in the second embodiment of the present invention; 

FIG. 15 shows an image obtained by performing 
the tone correction of the second embodiment of the 
5 present invention to the input image shown in FIG. 12; 

FIG. 16 is a graph used for explaining a tone 
correction according to statistics of a background color 
in a seventh embodiment of the present invention; and 

FIG. 17 shows an image obtained by performing 
10 the tone correction of the seventh embodiment of the 
present invention to the input image shown in FIG. 12. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
A description will now be given, with 

15 reference to the drawings, of embodiments according to 
the present invention. 

First, a description will be given, with 
reference to FIG.l to FIG. 9, of a first embodiment 
according to the present invention. 

20 FIG.l is a diagram showing an example of a 

configuration of a system according to the present 
embodiment. As shown in FIG.l, in a server /client 
system 11 as the system according to the present 
embodiment, a plurality of personal computers. 301 as 

25 image processing devices are connected to a server 
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computer 101 via a network 201, such as a LAN. 

The server/client system 11 is provided with 
an environment in which an image input apparatus 401, 
such as a scanner or a digital camera, and an image 
output apparatus 501, such as a printer, can be shared 
on the network 201. A multi-function peripheral 
(hereinafter referred to as MFP) 601 is connected to the 
network 201. The server/client system 11 may be 
provided with an environment in which the MFP 601 
functions as the image input apparatus 401 or the image 
output apparatus 501. 

The server/client system 11 is capable of 
communicating data with another server/client system 31 
via an intranet 21, for example, and is capable of 
communicating data with an external environment via an 
Internet communication network 41. 

Next, a description will be given, with 
reference to FIG. 2, of a module structure of the 
personal computer 301. FIG. 2 is a diagram showing the 
module structure of the personal computer 301 according 
to the present embodiment. As shown in FIG. 2, the 
personal computer 301 comprises a CPU 302 processing 
information, a primary storage 305, such as a ROM 303 
and a RAM 304, storing information, a secondary storage 
307, such as an HDD (hard disk drive) 306, a removable 
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disk device 308, a network interface 309 for 
communicating information with other .external computers, 
a display 310 displaying a processing progress, a 
processing result and so forth to an operator, a 
5 keyboard 311 for an operator to input a command, 

information and so forth into the personal computer 301, 
and a pointing device 312, such as a mouse. The 
removable disk device 308 reads information stored in 
media 308a that has a portability such as for obtaining 

10 information from outside, keeping information, and 
distributing information to outside. 

Besides, in the present embodiment, a CD-ROM 
is used as the portable media 308a, and the removable 
disk device 308 is realized as a CD-ROM drive capable of 

15 reading information stored in the CD-ROM. 

A bus controller 313 mediates transmission and 
reception of data among the above-described parts 
composing the personal computer 301 including the CPU 
302, the primary storage 305, the secondary storage 307, 

20 the removable disk device 308, the network interface 309, 
the display 310, the keyboard 311 and the pointing 
device 312. Each of the above-described parts composing 
the personal computer 301 operates according to data and 
so forth mediated by the bus controller 313. 

25 In the personal computer 301, when a user 



-20- 



turns a power supply on, the CPU 302 starts a program 
referred to as a loader stored in the ROM 303, reads a 
program managing hardware and software of the computer 
from the HDD 306 to the RAM 304, and starts the read 
5 program. In the present embodiment, the program read 
from the HDD 306 to the RAM 304 for managing hardware 
and software of the computer is referred to as an 
operating system. According to operations performed by 
a user, the operating system starts an application 

10 program and so forth, reads information, and saves 

information. Windows (TM) , UNIX (TM) , and so forth are 
well known as representing the operating system. 
Besides, in the present embodiment, the application 
program is an operational program executed on the 

15 operating system. 

The personal computer 301 according to the 
present embodiment stores an image processing program in 
the HDD 306 as an application program. Thus, in the 
present embodiment, a storage medium storing the image 

20 processing program is realized as the HDD 306. 

Various operational programs, such as the 
image processing program, may be installed, for example, 
from optical information recording media, such as a CD- 
ROM, or magnetic media, such as an FD, which stores 

25 various operational programs, to the secondary storage 



307, such as the HDD 306. Thus, the storage medium 
storing the image processing program is realized also as 
various portable storage media, such as the optical 
information recording media, such as a CD-ROM, and the 
magnetic media, such as an FD. 

Besides, various operational programs, such as 
the image processing program, may be installed, for 
example, from outside via the network interface 309 to 
the secondary storage 307, such as the HDD 306. 

Additionally, in the present embodiment, an 
image processing described hereinafter is performed 
according to the image processing program installed in 
the secondary storage 307, such as the HDD 306. However, 
not limited thereto, the hereinafter-described image 
processing may be performed according to the image 
processing program downloaded from outside via the 
network interface 309 to the secondary storage 307. 

Next, a description will be given, with 
reference to FIG. 3 to FIG. 8, of parts operated by the 
personal computer 301. FIG. 3 is a functional block 
diagram illustrating the parts operated by the personal 
computer 301 according to the image processing program. 
FIG. 4 is a flowchart showing a procedure of operating 
the parts included in the functional block diagram shown 
in FIG. 3. 
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The personal computer 301 starts the image 
processing program executed on the operating system, and 
performs various processings by the CPU 302 according to 
the image processing program so as to control each of 
5 the parts composing the personal computer 301 integrally. 
Upon executing the image processing program, the 
personal computer 301 performs various steps shown in 
the flowchart of FIG. 4 by the parts (functions) shown in 
the functional block diagram of FIG. 3. 

10 <1. Outline of Processing> 

First, a description will be given of an 
outline of the processing. Many characters are printed 
on a document image, and a common document includes a 
part in which a black character is printed directly on a 

15 space including no print. Thus, when extracting an area 
likely to have a black character from an input image, 
and dividing this input image into sufficiently small 
blocks, a certain block including a black character 
printed directly on a space assumably exists. 

20 Accordingly, supposing that a color of the space, i.e., 
a ground color, is white, the procedure of the image 
processing can be specified as follows. 

As shown in the flowchart of FIG. 4, in 
performing the image processing program according to the 

25 present embodiment, first, the personal computer 301 
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receives an original image I 0 (input image) via the 
network interface 309 (step SI) . In the present 
embodiment, the original image Io is a digitized color 
image, for example. 
5 A low-resolution image generation part 1001 

generates a low-resolution image (I) having a lower 
resolution than the original image (I 0 ) from the 
original image (Io) (S2) . 

A low resolution processing part 2000 smoothes 

10 the generated low-resolution image (I) by a smoothing 

part 2001 (S3) , and sets a window of a fixed size around 
each pixel of the low-resolution image (I) , and 
calculates an average value |i and a standard deviation a 
of signals for each of channels of R, G and B by a 

15 feature-value calculation part 2002 so as to calculate 
feature values of the low-resolution image (I) (S4). 

A character area extraction part 3001 performs 
a local adaptive binarization of each color component by 
performing a local adaptive threshold processing and an 

20 expansion processing to the low-resolution image (I) so 
as to perform an extractive detection of a character 
area C (shown in FIG. 6) (S5) . 

A black-character-color/ground-color 
statistical estimation part 4001 divides the input 

25 original image (I 0 ) into fixed-size blocks not 
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overlapping one another (S6) , and classifies pixels 
belonging to the character area C into two classes, i.e., 
a first class and a second class, according to two 
representative colors in each of the divided blocks (S7). 
5 Thus, functions of a block division part and a class 
classification part are performed. According to 
brightness of. the pixels belonging to the character area 
C, a brighter color is usually related to a background 
color of the character area, and a darker color is 

10 usually related to a character color. 

Besides, the black-character-color /ground- 
color statistical estimation part 4001 selects a block B 
including a maximum number of pixels classified into 
either of the classes as a window W from the blocks in 

15 which the pixels belonging to the character area C are 
classified into the two classes according to the two 
representative colors, sets the two representative 
colors in the window W as an average color of a ground 
and an average color of black characters in the input 

20 image, respectively, and further estimates a black 
character color and a ground color according to 
statistics of the brightness (S8) . Each of brightnesses 
is an average value of R, G and B signals obtained by an 
operation represented by an expression (1) below, for 

25 example. An average value and a standard deviation of 



the brightnesses are obtained from the brightnesses as 
the statistics of the brightness. 

(Expression 1) 

Brightness = (r+g+b) /3 

A tone correction part 5001 performs a tone 
correction of each of the pixels in each of the blocks 
according to saturation reference values calculated from 
the statistics of the brightness (S9) . In the present 
embodiment, the ground color is made white (maximum 
brightness) , and the black character color is made black 
(minimum brightness) . 

A corrected image obtained as above is 
output/transmitted from the personal computer 301 via 
the network interface 309 and so forth (step S10) . 

<2 . Details of Processing> 

Next, a description will be given of details 
of each of the above-described parts operated by the 
personal computer 301. FIG. 5 shows an example of the 
input image, i.e., the original image (Io)/ which 
includes Chinese characters. 

(1) Generation of low-resolution image (low- 
resolution image generation part 1001) 

The low-resolution image generation part 1001 
converts the original image (I 0 ) into the low-resolution 
image (I) of approximately 100 dpi so as to reduce an 
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amount of calculations. First, the low-resolution image 
generation part 1001 simply divides the input image (I 0 ) 
into blocks B not overlapping one another (see FIG. 6), 
and sets an average value of signals in each of the 
5 blocks B as a signal of a corresponding pixel in the 
low-resolution image. (I). In this course, supposing 
that a size of the block B, i.e., a reduction ratio, is 
r, the average value of the signals in the block B can 
be obtained by an operation represented by an expression 
10 (2). 

(Expression 2) 

r(i+l)-lr(y+l)-l 

I[i,j;k]= £ £ I 0 [m,n;k]/ (rxr) ( k=R, G, B) 

m=ri m-rj 

Besides, the reduction ration r is arranged to 
always become a natural number; when a resolution of the 

15 original image (10) cannot be divided by 100 without 

leaving a remainder, the reduction ration r is rounded 
into a natural number. For example, when the resolution 
of the original image (10) is 360 dpi, the reduction 
ration r is rounded into 3. 

20 Thus, using the low-resolution image (I) 

having the lower resolution than the original image (I 0 ) 
reduces the amount of the following calculations in the 
image processing, .thereby simplifying the processing. 
(2) Smoothing (smoothing part 2001) 



The smoothing part 2001 smoothes the low- 
resolution image (I) by using a linear filter so as to 
remove noises. 

(3) Calculation of feature values (feature- 
value calculation part 2002) 

The feature-value calculation part 2002 sets 
window of a fixed size around each pixel of the low- 
resolution image (I) , and calculates the average value 
and the standard deviation a of signals for each of 
channels of R, G and B. 

(4) Extraction of character area (character 
area extraction part 3001) 

The character area extraction part 3001 
extracts the character area C by performing a local 
adaptive binarization of each color component with 
respect to the low-resolution image (I). Upon 
extracting the character area C, the character area 
extraction part 3001 compares a signal value of each 
pixel with a threshold value ja(a+ba) having parameters 
and b; then, according to whether the signal value is 
higher or lower than the threshold value |j,(a+ba), the 
character area extraction part 3001 extracts the 
character area C. For example, in an image in which a 
black character is printed directly on a ground, there 
is a tendency of a contrast becoming strong at all of 
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the channels of R, G and B. Accordingly, in the present 
embodiment, a pixel (i, j) having a signal value lower 
than the threshold value n>(a+ba) at all of the channels 
is set as an element of the character area C (see 
expression (3) ) . 

(Expression 3) 

if J[i, j;R] < {a R +b R *a R ) /j r & X[i, j;G] < ( a G +bcrcr G ) ju G & 
J[i, j;B]< (a B +b B -cr B ) fi B 

Besides, the character area extraction part 
3001 forms a binary image having a same size as the 
original image (Iq) which binary image has an ON value 
at (an element of) the character area C, and has an OFF 
value at other area (see FIG. 6). The character area 
extraction part 3001 builds a transverse run along which 
background pixels (pixels of the OFF value) exist in the 
above-mentioned binary image. When a certain run has a 
length shorter than a predetermined threshold value t, 
the character area extraction part 3001 temporarily 
turns ON the pixels existing along the run. Thus, the 
character area extraction part 3001 performs a smearing 
of the run of the background pixels in a transverse 
direction, and similarly, performs a smearing of a run 
of background pixels in a longitudinal direction. Then, 
the character area extraction part 3001 obtains an AND 
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of the smearing of the transverse run of the background 
pixels and the smearing of the longitudinal run of the 
background pixels so as to set only pixels provided with 
the ON value by smearing in both directions as final 
5 foreground monochrome pixels (the character area C) . 
Thereby, an image I' including the character area C 
extracted as shown in FIG. 6 can be obtained from the 
original image (Io) as shown in FIG. 5. 

(5) Block division and class classification of 

10 character area (block division part, class 
classification part) 

The block division part divides the original 
image (I 0 ) into sufficiently small blocks B not 
overlapping one another (see FIG. 6). In this course, 

15 each of the blocks B may be provided with a size and a 
shape, such as a square with each side having a length 
of r equivalent to 20mm (160 pixels in 200 dpi, 320 
pixels in 400 dpi) . The class classification part 
classifies pixels belonging to the character area C into 

20 two classes according to brightness of each of the 
pixels in each of the divided blocks B. 

Upon classifying the pixels, the class 
classification part calculates the brightness from color 
signals of each of the pixels, and performs a threshold 

25 processing thereto. Known methods, such as a 
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discrimination analysis method, a moment preservation 
method, and an entropy method, can be used for the 
threshold processing, which is a known technology, and 
is not explained herein. In a block Bi, the class 
classification part classifies a group of darker pixels 
(each having a low brightness) into a first class R ilf 
and classifies a group of brighter pixels (each having a 
high brightness) into a second class R i2 . Since a 
brighter color is usually related to a background color 
of the character area, and a darker color is usually 
related to a character color, the first class R ix 
corresponds to a character, and the second class R± 2 
corresponds to a background (a ground) in the present 
embodiment. Thus, the class classification part 
classifies pixels belonging to the character area C into 
the first class R±i and the second class R i2 in all of 
the blocks B. 

(6) Estimation of black character color and 
ground color (black-character-color/ground-color 
statistical estimation part 4001: black-character- 
color/ground-color estimation part) 

The black- character-color /ground-color 
statistical estimation part 4001 selects a block B 
including a maximum number of pixels classified into the 
second class R i2 as a window W from all of the blocks in 
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which pixels belonging to the black character area C are 
classified into the first class R±i and the second class 
R i2 , and sets an average color of pixels (a group of 
pixels R w i) classified into the first class R±i in the 
5 selected window W as a black character color in the 
original image. FIG. 7 shows an area selected as the 
window W in the input image shown in FIG. 5, and groups 
R wi and R w2 of pixels therein. In FIG. 7, the black- 
character-color /ground-color statistical estimation part 

10 4001 sets the average color of the group of the pixels 
R wi as the color of a black character in the original 
image, and sets an average color of the group of the 
pixels R W 2 as a ground color in the original image. 

(7) Tone correction according to statistics of 

15 black character color and ground color (tone correction 
part 5001) 

The tone correction part 5001 calculates 
statistics (an average value and a standard deviation) 
of brightness in each of the groups R w i and R w2 , and 

20 performs a tone correction according to the statistics 
(li, a x , 1 2 , cr 2 ) regarding the black character color and 
the ground color of the input image which statistics are 
calculated in the window W. The statistics li and ai are 
the calculated average value and the standard deviation, 

25 respectively, of the brightness in the group R w i - The 
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statistics 1 2 and a 2 are the calculated average value and 
the standard deviation, respectively, of the brightness 
in the group R w2 . The average value li is smaller than 
the average value 1 2 - 

In a default tone correction performed by the 
tone correction part 5001, each of R, G and B components 
may be converted by a function that may shift the 
average value li to 0, and shift the average value 1 2 to 
255, as shown in expression (4) below. 

(Expression 4 ) 

r 

0 if x<l x 



255 (x-li) if li=<x=<2 2 



</ 2 -A) 

^ 255 if l 2 <x 
At this point, a range of an input signal 
value to be clipped to 0 or 255 needs to be adjusted 
according to characteristics of the image output 
apparatus. For example, in a printer having a high 
capability in representing a highlight color, a 
brightness value for an input signal to be clipped to 
white needs to be set low. On the other hand, in a 
printer having a low capability in representing a 
highlight color, a brightness value for an input signal 
to be clipped to white does not need to be set low. 

Besides, the tone correction of the black 
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character color and the ground color by the tone 
correction part 5001 needs to be performed in 
consideration of changes of the ground color in the 
input image. Therefore, a degree of the correction is 
5 adjusted according to the standard deviation 

(distribution) C7i of the black character color and the 
standard deviation (distribution) a 2 of the ground color 
calculated above. Specifically, a correction according 
to an operation represented by an expression (5) below 

10- may be performed (see FIG. 8). Besides, values of 

adjustment parameters a and b are specified by a user, 
and are suitably set according to characteristics of the 
printer used for printing, preferences of the user, and 
so forth. 

15 (Expression 5) 

r 

0 if x<2i+a-ai 



f(x)= < 



255 

(x-li-a-ai) if Ii+a-CTi=<x=<l 2 -b*a2 



<7 2 -b-a 2 -/, -a-a x ) 



^ 255 if l 2 -b-a 2 <x 

As shown in FIG. 8, larger values of the 
20 adjustment parameters a and b intensify an enhancement 
of the black character, a removal of the ground, and a 
contrast enhancement. FIG. 9 shows an image I" obtained 
by performing the tone correction to the input image 
(the original image (Io)) shown in FIG. 5. 
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By the way, since the tone correction depends 
on characteristics of the image output apparatus, the 
degree of the correction for the black character and the 
ground has to be able to be easily adjusted. For 
5 example, a printer having a high capability in 

representing a highlight color needs a setting in which 
a brightness value for an input signal to be clipped to 
white is set low; on the other hand, a printer having a 
low capability in representing a highlight color does 

10 not need such a setting. Further, the adjustment of the 
degree of the correction has to be performed in 
consideration of changes of the ground color in the 
input image. In order to achieve objects of a 
"comprehensive tone correction suitable for a document 

15 image, such as changing a ground into white and changing 
a black character into black while maintaining a 
gradation of a middle-tone area" and "parameters (the 
color of the ground or the black character) adaptively 
calculable from the image which parameters determine the 

20 processing", the black character color and the ground 
color need to be estimated correctly. 

In the present embodiment, according to the 
calculated distributions (the standard deviations) of 
the black character color and the ground color, a 

25 default tone correction is performed for example in 



which the average value of the ground color is shifted 
to white (maximum brightness) , and the average value of 
the black character color is shifted to black (minimum 
brightness) ; and upon adjusting the degree of the 
correction, the degree of the correction is shifted from 
the average values by products of the standard 
deviations and the adjustment parameters. Accordingly, 
a user can easily adjust the processing according to 
characteristics of the image output apparatus. 

According to the present embodiment, upon 
outputting the original image (I 0 ), which is a digital 
image input from the image input apparatus 4 01, from the 
image output apparatus 501, such as a printer, a tone 
correction is performed to the original image (I 0 ) 
according to a black character color and a ground color 
estimated from the original image (I 0 ) according to 
values calculated from feature values of the original 
image (I 0 ), without necessarily using predetermined 
parameters. Therefore, while a gradation of a middle- 
tone area in the original image (I 0 ) input from the 
various image input apparatus 401 is maintained, a 
contrast between the black character and the ground in 
an output image can be automatically adjusted, without a 
user performing an operation, such as setting a 
parameter . 



Besides, according to the present embodiment , 
based on results of the class classification in each of 
the blocks B, an average color of the group of pixels R wl 
classified into the first class R ±1 in the window W that 
is a block B including a maximum number of pixels 
classified into the second class R i2 is estimated as the 
black character color, and an average color of the group 
of pixels R w2 classified into the second class R i2 in the 
window W that is the same block B is estimated as the 
ground color. Thus, the black character color and the 
ground color can be estimated correctly. 

Next, a description will be given, with 
reference to FIG. 10 to FIG. 15, . of a second embodiment 
according to the present invention. Elements in the 
present second embodiment and other hereinafter- 
described embodiments that are identical or equivalent 
to the elements described in the foregoing first 
embodiment are referenced by the same reference marks, 
and will not be described in detail. 

FIG. 10 is a functional block diagram 
illustrating parts operated by the personal computer 301 
according to the image processing program in the present 
second embodiment. FIG. 11 is a flowchart showing a 
procedure of operating the parts included in the 
functional block diagram shown in FIG. 10. Upon 
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executing the image processing program, the personal 
computer 301 performs various steps shown in the 
flowchart of FIG. 11 by the parts (functions) shown in 
the functional block diagram of FIG. 10. 
5 In the present second embodiment, the low 

resolution processing part 2000 not only includes the 
smoothing part 2001 and the feature-value calculation 
part 2002, but also includes a character area extraction 
part 2003 (see the character area extraction part 3001 

10 in FIG. 3) performing a local adaptive binarization of 
each color component by performing a local adaptive 
threshold processing and an expansion processing to the 
low-resolution image (I) so as to perform an extractive 
detection of the character area C (shown in FIG. 6). 

15 Besides, in the present second embodiment, the 

low resolution processing part 2000 also includes a 
background color estimation part 2004 and a background 
area extraction part (a background area specification 
part) 2005, as described hereinbelow. 

20 Step SI to step S7 in FIG. 11 are performed in 

a similar manner as step SI to step S7 in FIG. 4. Then, 
the background color estimation part 2004 selects a 
window having a maximum value of an objective function 
determined according to a number of pixels classified 

25 into a brighter-color class and a brightness of the 



-38- 

brighter color, sets the brighter representative color 
in this window as an average color of a background in 
the input image, and further estimates a background 
color by calculating statistics of the brightness (S20). 
5 Then, the background area extraction part 2005 extracts 
(specifies) a background area according to the 
calculated statistics (S21) . Subsequently, the tone 
correction part 5001 performs a tone correction by 
replacing a color of the extracted (specified) 

10 background area with the estimated average color of the 
background, and preventing pixels other than the 
background from being gradationally discontinuous with 
the background area (S22) . Then, step S10 is performed. 

Next, a description will be given of details 

15 of each of the above-described parts operated by the 

personal computer 301 in the present second embodiment. 
FIG. 12 shows an example of the input image, i.e., an 
original image (Io')r in the present second embodiment. 
In the original image (I 0 ')/ compared with the original 

20 image (I 0 ) , an area forming a background part is 
composed of a plurality of background colors. 

(8) Estimation of color of background 
(background color estimation part 2004) 

The background color estimation part 2004 

25 defines an objective function based on a number N± of 



pixels classified into the second class R i2 and an 
average brightness L ± of the second class R i2 , and finds 
a block (a window) W in which the objective function has 
a maximum value. The objective function is defined, for 
example, such as NiXL ± , which has a larger value as the 
number Ni of pixels becomes larger and the average 
brightness Li becomes higher (see FIG. 7). In the 
present second embodiment, the background color 
estimation part 2004 sets an average color of a group of 
pixels R w2 (classified into the second class R i2 ) as a 
background color in the original image. Further, the 
background color estimation part 2004 calculates 
statistics (an average value 1 B and a standard deviation 
a) of brightness in the group R w2 . 

(9) Extraction (specification) of background 
area (background area extraction part 2005: background 
area specification part) 

The background area extraction part 2005 
extracts (specifies) a background area according to the 
statistics of the brightness of the background. In the 
present second embodiment, a pixel having a brightness L 
satisfying l B -aa<L<l B (wherein a is a positive parameter) 
in the low-resolution image (I) is regarded as the 
background area. The parameter a is determined 
according to changes of the background, a degree of a 
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show-through effect and so forth; for example, when the 
degree of the show-through effect is strong, the 
(adjustment) parameter a may be set large. FIG . 13 shows 
an image Ii' in which pixels extracted as the background 
5 are represented as black dots. 

As described above, in the present second 
embodiment, the background area is determined according 
to the average value 1 B and the standard deviation a of 
the brightness in the group R w2 . However, not limited 
10 thereto, by using a median m B and an a-percent quantile 
a B (e.g., a being 25) of a brightness distribution in 
the group R w2 instead, a pixel having a brightness L 
satisfying a B <L<m B may be extracted as the background 
area . 

15 (10) Tone correction according to statistics 

of color of background (tone correction part 5001) 

First, with respect to the original image 
(Io'), the tone correction part 5001 according to the 
present second embodiment replaces a color of each pixel 

20 in the background area with the average color of the 
background calculated in the block W. As for pixels 
other than the background, the tone correction part 5001 
converts each of R, G and B components by using an 
expression (6) shown below so as to prevent the pixels 

25 other than the background from being gradationally 
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15 



20 



discontinuous with the background (see FIG. 14). 
(Expression 6) 



r I 



B 



x if x<I B -aa 



f(x) = 

5 L 1b if l B -aa<x 

FIG. 15 shows an image I x " obtained by 

performing the tone correction to the input image (the 

original image (I 0 ')) shown in FIG. 12. 

As described above, in the present second 
10 embodiment, each of R, G and B components is converted 

according to the expression (6) . However, not limited 

thereto, each of R, G and B components may be converted 

by using an expression (7) shown below. 
(Expression 7) 

r 



— x if x<a B 

^ m B if a B <x 
Thus, in the present second embodiment, since 
a color document image is composed of two types of areas, 
i.e., a foreground and a background, from a local point 
of view, an issue of a statistical estimation of a 
background color is reduced to a simple two-class matter. 
Specifically, a partial area containing a true 
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background color is searched for from a digital color 
image, the area is classified into two classes of the 
foreground and the background, and a representative 
color and a variation of the background (a ground) are 
5 extracted from a color and a gradation distribution of 
the background. Therefore, a conventional analysis of a 
histogram comprising a mixture distribution can be 
avoided. 

Then, a background area is extracted from the 

10 image as a whole according to these statistics, and the 
background area is replaced with the estimated 
representative color, and further, pixels other than the 
background are subjected to a tone conversion so as to 
prevent the pixels other than the background from being 

15 gradationally discontinuous with the background area. 

Accordingly, a color of a background/ground 
can be automatically calculated from image 
characteristics by a statistical procedure without 
necessitating knowledge and characteristic information 

20 concerning the input apparatus or using predetermined 
parameters. Thus, an image deterioration originating 
from causes in a background part, such as changes of 
color of the background or a show-through effect, in an 
original image input from the various image input 

25 apparatus is corrected automatically without depending 
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on the knowledge and characteristic information 
concerning the input apparatus or using predetermined 
parameters, or without a user performing an operation, 
such as setting a parameter, so that the background part 
5 is made more appropriate, thereby obtaining an output 

image containing easily viewable characters and so forth. 
Besides, according to the present second embodiment, an 
image deterioration originating from causes in a 
background part, such as changes of color of the 

10 background or a show-through effect, in a digital image 
transmitted from a remote place via a network with an 
input apparatus being unknown is corrected automatically 
so as to make the background part more appropriate, 
thereby obtaining an output image containing easily 

15 viewable characters and so forth. 

Next, a description will be given, with 
reference to FIG.4 or FIG. 11, of a third embodiment 
according to the present invention. In the present 
third embodiment, the calculation of feature values in 

20 step S4 in FIG.4 or FIG. 11 and the extraction of a 
character area in step S5 in FIG.4 or FIG. 11 are 
performed according to edge amounts. 

When a black character is printed directly on 
a ground, there is a tendency of a contrast becoming 

25 strong at all of channels of R, G and B. 
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Accordingly, in the present third embodiment, 
a character area is extracted by using a minimum edge 
amount among edge amounts calculated at each of the 
channels of R, G and B as an edge amount in a subject 
5 pixel. 

Upon extracting the character area, first, an 
edge amount in each pixel is calculated, and when the 
edge amount calculated in each pixel is higher than a 
predetermined threshold value set beforehand prior to 

10 image processing, the pixel is set as an element of the 
character area C. Thereafter, smearing of a run of 
pixels is performed in the same manner as in the 
foregoing first embodiment . 

For example, an edge amount in a pixel (i, j) 

15 may be obtained as a maximum edge amount among edge 
amounts separately calculated for each of the three 
channels (R, G and B) . That is, the edge amount in the 
pixel (i, j) can be represented by an expression (8) 
shown below. Thus, a function of the feature-value 

20 calculation part is performed. 

(Expression 8 ) 

E 0 [i, j]=max{S j;R] , S [I; i, j;G] , S [I; i, j;B] } 
In the expression (8), S[I;i,j;k] represents 
an edge intensity in the pixel (i, j) at a k channel in 
25 the low-resolution image (I) . 
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By performing the threshold processing (thr) 
to the edge amount calculated as above, a pixel having 
an edge amount higher than the predetermined threshold 
value is specified. Accordingly, the pixel having the 
5 edge amount higher than the predetermined threshold 
value, and pixels around this pixel, are extracted as 
the character area C. Thus, a function of the character 
area extraction part is performed. Besides, the above- 
mentioned predetermined threshold value may be set 
10 beforehand as a fixed value, or may be set/changed from 
outside prior to the execution of the image processing 
program. 

According to the present third embodiment, a 
black character color and a ground color (a background 
15 color) can be estimated correctly. 

Next, a description will be given, with 
reference to FIG. 4 or FIG. 11, of a fourth embodiment 
according to the present invention. 

In the present fourth embodiment, the 
20 calculation of feature values in step S4 in FIG. 4 or 

FIG. 11 and the extraction of a character area in step S5 
in FIG. 4 or FIG. 11 are performed by using not only 
statistics in a window W but also edge amounts. 

Upon calculating feature values, statistics in 
25 the window W and an edge amount are calculated in the 
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same manner as in the foregoing first embodiment to the 
foregoing third embodiment. Thus, a function of the 
feature-value calculation part is performed. 

Subseguently, upon extracting a character area 
5 C, first, it is judged whether a signal value is lower 
at all of the channels than a threshold value calculated 
from the statistics in the window W, or whether the edge 
amount is higher than a predetermined threshold value. 
Then, a pixel judged to have a signal value lower at all 

10 of the channels than the threshold value calculated from 
the statistics in the window W, or to have an edge 
amount higher than the predetermined threshold value, 
and pixels around this pixel, are set as elements of the 
character area C. Thus, a function of the character 

15 area extraction part is performed. 

Thereafter, smearing of a run of pixels is 
performed in the same manner as in the foregoing first 
embodiment . 

According to the present fourth embodiment, a 
20 black character color and a ground color can be 
estimated correctly. 

Next, a description will be given, with 
reference to FIG. 4 or FIG. 11, of a fifth embodiment 
according to the present invention. 
25 In the present fifth embodiment, the 



generation of a low-resolution image (I) having a lower 
resolution than the original image (I 0 ) or (I 0 ' ) in step 
S2 in FIG. 4 or FIG. 11 is not performed, and the 
calculation of feature values in step S4 and the 
extraction of a character area in step S5 are performed 
with respect to the original image (I 0 ) or Uo'). 

According to the present fifth embodiment, the 
calculation of the feature values and the extraction of 
the character area can be performed with respect to the 
original image (I 0 ) or (I 0 ') without performing the 
generation of the low-resolution image (I) . 

Next, a description will be given, with 
reference to FIG. 4, of a sixth embodiment according to 
the present invention. In the present sixth embodiment, 
the estimation of a black character color in step S8 in 
FIG. 4 is performed in a different manner from the 
foregoing first and third to fifth embodiments. 

(6)' Estimation of black character color and 

ground color 

First, pixels belonging to the black character 
area C are classified into the first class Ru and the 
second class R i2 in all of the blocks B. Next, a block B 
Including a group of pixels R w i (classified into the 
first class Ru) and a group of pixels R W 2 (classified 
into the second class R i2 ) having a maximum difference in 
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average brightness therebetween is selected, and this 
block B is set as a window W. Then, an average color of 
the group of the pixels R w i in the window W is set as a 
black character color in the original image (I 0 ) , and an 
5 average color of the group of the pixels R W 2 is set as a 
ground color in the original image (I 0 ) - 

Besides, the estimation according to the 
present sixth embodiment may be similarly applied to the 
foregoing second embodiment and the following seventh 
10 embodiment each of which performs the estimation of a 
background color in step S20 in FIG. 11. 

Besides, in the present sixth embodiment, 
statistics used in tone correction are calculated in the 
window W. 

15 Next, a description will be given, with 

reference to FIG. 11, FIG. 16 and FIG. 17, of a seventh 
embodiment according to the present invention. In the 
present seventh embodiment, upon performing a tone 
correction in step S22 in FIG. 11, each of R, G and B 

20 components is not converted according to the function of 
the foregoing expression (6) (see FIG. 14), but is 
converted according to a function of an expression (9) 
shown below (see FIG. 16). 

(Expression 9) 
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r 255 



10 



15 



20 



x 



if x<l a -ao 



f(x) = 



255 if l B -aa<x 
Thus, by performing the tone correction in 
which each of R, G and B components is converted 
according to the function of the expression (9), an 
image I 2 " including a background part converted into 
white as shown in FIG. 17 can be obtained. 

As described above, in the present seventh 
embodiment, each of R, G and B components is converted 
according to the expression (9) . However, not limited 
thereto, each of R, G and B components may be converted 
by using an expression (10) shown below. 
(Expression 10) 

r 255 



if x<oc B 



f (x) = 



< 



255 if cc B <x 
Besides, the tone correction according to the 
present seventh embodiment may be applied to the 
foregoing second to sixth embodiments each of which 
performs a similar tone correction in step S22 in FIG. 11. 

The present invention is not limited to the 
specifically disclosed embodiments, and variations and 
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modif ications may be made without departing from the 
scope of the present invention. 

The present application is based on Japanese 
priority applications No. 2002-259618 filed on September 
5, 2002, No. 2002-280789 filed on September 26, 2002, No. 
2003-048834 filed on February 26, 2003, and No. 2003- 
163565 filed on June 9, 2003, the entire contents of 
which are hereby incorporated by reference. 



